Database for detecting counterfeit items using digital fingerprint records

ABSTRACT

Improvements are disclosed for detecting counterfeit objects, based on comparison to digital fingerprints that describe features found in images of objects known to be counterfeit.

RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 15/208,339, filed Jul. 12, 2016, which is a divisional of U.S. application Ser. No. 14/531,724, filed Nov. 3, 2014 (now U.S. Pat. No. 9,443,298), which is a non-provisional, pursuant to 35 U.S.C. § 119(e), of U.S. provisional application No. 61/914,722 filed Dec. 11, 2013 and U.S. provisional application No. 61/898,780 filed Nov. 1, 2013, both incorporated herein by this reference. Application Ser. No. 14/531,724 is a continuation-in-part of U.S. application Ser. No. 14/290,653 filed May 29, 2014 (now U.S. Pat. No. 9,350,552), which is a continuation of U.S. application Ser. No. 13/410,753 filed Mar. 2, 2012 (now U.S. Pat. No. 8,774,455), which is a non-provisional, pursuant to 35 U.S.C. § 119(e), of U.S. provisional application No. 61/448,465 filed Mar. 2, 2011, each of which are incorporated herein by reference.

COPYRIGHT NOTICE

©2018 Alitheon, Inc. A portion of the disclosure of this patent document contains material which is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever. 37 CFR § 1.71(d).

TECHNICAL FIELD

This invention pertains to methods and apparatus to identify and/or authenticate physical items, including documents, and more specifically to using a secure database of digital fingerprint records to detect a counterfeit item.

BACKGROUND OF THE INVENTION

Counterfeiting of manufactured goods is a worldwide problem, with recent studies estimating that 8% of the world's total GDP is now generated by the manufacturing and sales of counterfeit products. Many classes of counterfeit goods create substantial risks to public health including counterfeit pharmaceutical drugs, auto parts, pesticides, and children's toys. In addition, counterfeit computer chips, aerospace parts, and identification documents present significant risks to national security.

Many different approaches have been tried to uniquely identify and authenticate objects, including serial numbers, bar codes, holographic labels, RFID tags, and hidden patterns using security inks or special fibers. All of these methods can be duplicated, and many add a substantial extra cost to the production of the goods being protected. In addition, physically marking certain objects such as artwork, gemstones, and collector-grade coins can damage or destroy the value of the object.

If identifying or certifying information is stored separately from the object in the form of a label, tag, or certificate the entire identification/certification process must typically be performed again if the object is lost and later recovered, or its chain of control is otherwise compromised. There is a need for solutions that can prove the provenance of an object once the chain of custody is disrupted by the removal of the object from safe custody and/or the loss of the associated identification or certification information.

Other known techniques call for comparing bitmaps of images of the objects themselves, or selected regions of interest. Referring now to FIG. 8, the image of the original object is taken and stored for reference. The whole image is stored, although it may be compressed for efficiency. When a new object is encountered, an image is taken of the new object and directly compared to the original image using XOR or similar algorithms. If there are no (or only statistically insignificant) differences, the images are declared a match and the object is authenticated. Further, FFT or similar transforms may be used to generate a “digital signature” of the image that can be used for comparison. See FIG. 9. However, as in the previous case the same method is used—the resultant bitmapped image is compared with another bitmapped image, and if the pixels match the object is authenticated. Such methods are disclosed in U.S. Pat. No. 7,680,306 to Boutant et al. Bitmapped techniques are inefficient due to issues like file size, and have serious limitations that make them effectively unusable in most real world applications, due to variable lighting and orientation of the images, and the authentication of worn, damaged or otherwise altered objects.

SUMMARY OF THE INVENTION

The following is a summary of the invention in order to provide a basic understanding of some aspects of the invention. This summary is not intended to identify key/critical elements of the invention or to delineate the scope of the invention. Its sole purpose is to present some concepts of the invention in a simplified form as a prelude to the more detailed description that is presented later.

A physical object is scanned and a digital image of the object is created from the scan. A subset of the image known as an “authentication region” is selected. A set of features is extracted from the authentication region, which is sufficient to create a unique identifier or “digital fingerprint” for that object. The digital fingerprint may be registered in a database.

To select locations in an image to extract fingerprint features, a software process automatically selects a large number—typically hundreds or even thousands per square mm—of preferred areas of interest for purposes of digital fingerprint. A location may be of interest because of a relatively high level of content. That “content” in a preferred embodiment may comprise a gradient or vector, including a change in value and a direction.

In a preferred embodiment, each such area of interest is identified as a circle, for example, by centroid location and radius. Within each circular area of interest, the software then extracts one or more fingerprint features that define the relevant shapes within the corresponding circular location of the image. Each fingerprint feature preferably is stored as a feature vector as illustrated below. A feature vector preferably is an array of integer or floating point values describing an individual shape.

When an object is to be authenticated, a suitable system compares the digital fingerprint of the object to digital fingerprints previously stored in the database, and based on that comparison determines whether the object has been registered before, and is thus authentic. The digital fingerprint data specifies a set of features. Preferably, an “object feature template” may be created which has a list of specific features and attributes that are relevant for authenticating a particular class of objects. A template may identify locations of particular features. One of the key advantages of the feature based method is that when the object is very worn from handling or use, the system can still identify the object as original, may be impossible with the bitmapped approach.

Another aspect of this disclosure relates to detecting a counterfeit or forged object, for example a document such as a drivers license or passport. In this case, there may be no “original” or source object digital fingerprint for comparison. Rather, “fingerprints” of known indicia of counterfeit or forged objects can be acquired and stored. For example, a large number of counterfeit New York State driver's licenses might be obtained by law enforcement officials in a raid or the like. Digital images of those forged documents can be acquired, and analyzed to form digital fingerprints, as described in more detail below. “Forgery feature vectors” of typical features that occur in the counterfeit licenses can be collected and stored in a database. Such indicia may include, for example, sharp, non-bleeding edges where a photograph has been replaced or torn paper fibers where an erasure occurred. These stored features from the counterfeit licenses can then be analyzed and stored as a reference set of fraudulent methods which can then be compared to new license fingerprints to detect a forged document. A count of “fraud indicator matches” can be compared to an empirical threshold to determine and quantify a confidence that a document is forged (or not).

Further, the fingerprinting approach described below can be used to determine whether a manufactured object meets its manufactured specifications. Applications of the system include but are not limited to object authentication, anti-counterfeiting, determining the provenance of an object, and compliance with manufacturing specifications.

Additional aspects and advantages of this invention will be apparent from the following detailed description of preferred embodiments, which proceeds with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is an example of an authentication region and object feature template definition for a U.S. passport.

FIG. 2 is a simplified flow diagram of a process for digital fingerprint generation and registration.

FIG. 3 is a simplified flow diagram of a process for authentication of a previously fingerprinted object.

FIG. 4 is a simplified flow diagram illustrating a process for object inspection to detect evidence of counterfeits.

FIG. 5 is a simplified flow diagram illustrating an object manufacturing inspection process.

FIG. 6 is a simplified flow diagram illustrating a method for building a database for use in detecting forged or altered documents.

FIG. 7 is a simplified flow diagram illustrating a method for using a digital fingerprint of a suspect document to detect a potential forger or alteration associated with the document.

FIG. 8 is a simplified diagram of a prior art bitmap comparison method for comparing images.

FIG. 9 is an example of a photograph and an image created by Fast Fourier Transform (FFT) of the image data.

FIG. 10 is a simple illustration of fingerprint feature extraction from an original digital image.

FIG. 11 is a simple illustration of fingerprint feature extraction from a comparison or candidate image.

FIG. 12 is a simple illustration of fingerprint feature comparison for identifying or authenticating an object.

FIG. 13A shows an image of the numeral “3” representing the first digit in a serial number of an “original” or known U.S. dollar bill.

FIG. 13B shows an image of the numeral “3” representing the first digit in a serial number a U.S. dollar bill to be authenticated.

FIG. 14A is an illustration of results of feature extraction showing selected areas of interest in the image of FIG. 13A.

FIG. 14B is an illustration of results of feature extraction sowing selected areas of interest in the image of FIG. 13B.

FIG. 15A shows the same dollar bill image as in FIG. 13A, juxtaposed with FIG. 15B for comparison.

FIG. 15B shows an image of the numeral “3” that has been damaged or degraded.

FIG. 16A shows detail of two fingerprint feature locations on the numeral 3.

FIG. 16B shows detail of the damaged bill with the corresponding fingerprint feature locations called out for comparison.

FIG. 17 is a simplified illustration of a rotational transformation in the process of comparing digital fingerprints of two images.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

The methods described in this disclosure enable the identification of objects without attaching or associating any physical tags or materials with the object. A system does this by creating a unique digital signature for the object, which is referred to as a digital fingerprint. Digital fingerprinting utilizes the natural structure of the object, or essentially random features created incidental to the manufacturing process, to generate a unique digital signature for that object, much like a human fingerprint. Also like a human fingerprint, the digital fingerprint can be stored and retrieved to identify objects when they are encountered at a later date.

Eliminating the need to add tags or any physical modifications to the object offers a number of advantages to manufacturers, distributors, sellers and owners of goods. It reduces the cost of manufacturing, and is more secure than physical tagging. Physical tags may be lost, modified, stolen, duplicated, or counterfeited; digital fingerprints cannot.

Unlike prior art approaches that simply utilize a comparison of pixels, a system in accordance with this disclosure utilizes the extraction of features to identify and authenticate objects. Feature extraction enables us to take a large amount of information and reduce it to a smaller set of data points that can be processed more efficiently. For example, a large digital image that contains tens of thousands of pixels may be reduced to just a few features that can effectively identify the object. This reduced set of data we call a digital fingerprint. This digital fingerprint contains a set of individual fingerprint features which are stored as feature vectors. These vectors make image processing more efficient and reduce storage requirements, as the entire image need not be stored in the database, only the feature vectors. Examples of feature extraction algorithms include but are not limited to edge detection, corner detection, blob detection, wavelet features; Gabor, gradient and steerable output filter histograms, scale-invariant feature transformation, active contours, shape contexts and parameterized shapes.

While the most common applications of our system may be in the authentication of manufactured goods and documents, the system is designed to be applicable to any object that can be identified, characterized, quality tested, or authenticated with a digital fingerprint. These include but are not limited to mail pieces, parcels, art, coins, currency, precious metals, gems, jewelry, apparel, mechanical parts, consumer goods, integrated circuits, firearms, pharmaceuticals and food and beverages. Here we use the term “system” in a broad sense, including our methods as well as apparatus arranged to implement such methods.

Scanning

In an embodiment, an object is scanned and identified either at initial manufacture or at the time of first contact with the system. This point of identification is preferably done when the item is either in the possession of its manufacturer, or has been transferred by secure means to the current holder so that its legitimacy at point of identification is adequately established. When such a process is impossible, as in the example of artworks or old coins, the object may be fingerprinted after the object is authenticated by an expert while its provenance is still secure.

In this application, we use the term “scan” in a broad sense. We refer to any means for capturing an image or set of images, which may be in digital form or transformed into digital form. The images may be two dimensional, three dimensional, or be in the form of a video. Thus a “scan” may refer to an image (or digital data that defines an image) captured by a scanner, a camera, a specially-adapted sensor array such as CCD array, a microscope, a smart phone camera, a video camera, an x-ray machine, etc. Broadly, any device that can sense and capture electromagnetic radiation that has traveled through an object, or reflected off of an object, is a candidate to create a “scan” of the object. Other means to extract “fingerprints” or features from an object may be used; for example, through sound, physical structure, chemical composition, or many others. The remainder of this application will use terms like “image” but when doing so, the broader uses of this technology should be implied. In other words, alternative means to extract “fingerprints” or features from an object should be considered equivalents within the scope of this disclosure.

Authentication Regions

Because the system works with many different types of objects, it is necessary to define what parts of the digital images of the objects are to be used for the extraction of features for authentication purposes. This can vary widely for different classes of objects. In some cases it is the image of the entire object; in other cases it will be a specific sub-region of the image of the object.

For instance, for a photograph we may want to use the digital image of the entire photograph for feature extraction. Each photograph is different, and there may be unique feature information anywhere in the photograph. So in this case, the authentication region will be the entire photograph.

Multiple regions may be used for fingerprints for several reasons, two of which are particularly important. It may be that there are several regions where significant variations take place among different similar objects that need to be distinguished while, in the same objects, there may be regions of little significance, i.e., in which there is little or no variation among different objects. In that case, the authentication region is used primarily to eliminate regions of little interest.

A bank note, for example, has a sufficient number of unique features that it can be authenticated if a few small arbitrary regions scattered across the surface are fingerprinted, along with recognizing the contents of a region telling the value of the bank note and one containing the bank note's serial number. In such a case the fingerprints of any region (along with sufficient additional information to determine the bank note's value and its purported identity) may be sufficient to establish the authenticity of the bill and multiple fingerprinted regions are used solely in the event that one or more regions may be absent (through, for example, tearing) when the bill is later presented for authentication.

Sometimes, however, specific regions of an item must be authenticated to ensure the item is both authentic and has not been altered. A passport provides an example. On a passport the features preferably used for authentication are extracted from regions containing such specific identification information as the passport number, recipient name, and recipient photo. In that case, we define a template of all those regions whose alteration from the original would invalidate the passport, such regions including the passport holder's photo and unique personal data.

FIG. 1 illustrates an example of an authentication region and object feature template definition for a U.S. passport. In this figure, brace 100 refers to a simplified flow diagram of a process as follows. At process block 102, an object is scanned to generate an original “image”—technically a digital data file in any suitable format. We will simply refer to this data as an image. The original image is illustrated as the front page of a U.S. passport 150. Next, the system processes the image data to determine an authentication region. For example, here the authentication region is the lower portion of image 150, identified by dashed box 154. Next the process generates an authentication image for feature extraction, block 106. The image is illustrated at reference 156. Next, at block 108, the process defines one or more features for extraction. These are shown in the image 158 by dashed boxes 160, for example, surname, given name, and passport number regions.

Finally, at block 110, the process 100 comprises creating a feature template 120. In this example, template 120 identifies an object class (U.S. Passport), defines an authentication regions (for example, by X-Y coordinates), and it lists one or more features within that authentication region. Here, the list comprises passport number, photo, first name and last name.

The ability to define and store the optimal authentication region for a given class of objects offers significant benefits to the user. In many cases it is much easier to scan a limited region of an object than the entire object. For instance, in the case of an article of designer clothing, it is much easier to take a picture of the manufacturer's label than it is to take a picture of the entire garment. Further, defining such regions enable the detection of partial alteration of the object.

Once an authentication region is defined, specific applications can be created for different markets and classes of objects that can assist the user in locating and scanning the optimal authentication region. For instance, an appropriately sized location box and crosshairs can automatically appear in the viewfinder of a smartphone camera application to help the user center the camera on the authentication region, and automatically lock onto the region and take the picture when the camera is focused on the correct area.

In many cases, objects may have permanent labels or other identifying information attached to them. These can also be used as features. For instance, wine may be put into a glass bottle and a label affixed to the bottle. Since it is possible for a label to be removed and reused, simply using the label itself as the authentication region is often not sufficient. In this case we may define the authentication region to include both the label and the substrate it is attached to—in this case some portion of the glass bottle. This “label and substrate” approach may be useful in defining authentication regions for many types of objects, such as consumer goods and pharmaceutical packaging. If a label has been moved from it's original position, this can be an indication of tampering or counterfeiting. If the object has “tamper-proof” packaging, this may also be useful to include in the authentication region.

In some cases, we will want to use multiple authentication regions to extract unique features. For a firearm, for example, we might extract features from two different parts of the weapon. It is, of course, important that both match the original, but since the two parts may both have been taken from the original weapon and affixed to a weapon of substandard quality, it may also be important to determine whether their relative positions have changed as well. In other words it may be necessary to determine that the distance (or other characteristic) between Part A's authentication region and Part B's authentication region is effectively unchanged, and only if that is accomplished can the weapon be authenticated.

Object Feature Template Definition

When a new type or class of object is being scanned into the system for the first time, the system can create an Object Feature Template (as shown in FIG. 1) that can be used to optimize subsequent authentication operations for that class of objects. This template can either be created automatically by the system, or by using a human-assisted process.

An Object Feature Template is not required for the system to authenticate an object, as the system can automatically extract features and create a digital fingerprint of an object without it. However, the presence of a template can greatly optimize the authentication process and add additional functionality to the system.

TABLE 1 Example Object Feature Template. CLASS: [Description of the object] United States Passport AUTHENTICATION REGION: [Description of the authentication regions for the object] Region 1: (x1, y1, z1), (x2, y2, z2) . . Region n REGION MATCH LIST [List of the regions that are required to match to identify an object] Region List: 1..n FEATURES: [Key features of the object]  Feature 1: Passport Number Feature 2: Photo Feature 3: First Name Feature 4: Last Name . . Feature n METHODS: [Programs that can be run on features of an object] Feature 2: Photo Method 1: [checkphoto.exe] Check for uneven edges indicating photo substitution . . Method n Feature n Method n ADDITIONAL DATA [Additional data associated with the object] Data 1: example data . . Data n

The uses of the Object Feature Template include but are not limited to determining the regions of interest on the object, the methods of extracting fingerprinting and other information from those regions of interest, and methods for comparing such features at different points in time. The name “object feature template” is not important; other data with similar functionality (but a different moniker) should be considered equivalent.

Four different but related uses for this technology are particularly in view in this disclosure. These are illustrative but are not intended to be limiting of the scope of the disclosure. These applications may be classified broadly as (1) authentication of a previously scanned original, (2) detection of alteration of a previously scanned original, (3) detection of a counterfeit object without benefit of an original, and (4) determination whether a manufactured item is within manufacturing or other applicable specification.

In case (1), the object is fingerprinted during the creation process (or while its provenance is unquestioned), or at the point where an expert has determined its authenticity and then the object is later re-fingerprinted, and the two sets of fingerprints are compared to establish authenticity of the object. This may be done by extracting a single fingerprint from the entire object or by extracting multiple sets of features from different authentication regions. It may also be facilitated by reading or otherwise detecting a serial number or other identifying characteristic of the object using optical character recognition or other means to make determining which original to compare it with easier. In many cases, manufacturing databases use serial numbers as identifiers. If we know the serial number we can directly access the database record for the object, and can directly compare the digital fingerprint to the original that was stored during the creation process, rather than searching the entire digital fingerprinting database for a match.

In case (2), the object is compared region by region with the original looking for low or nonexistent match of the fingerprint features from those regions. While case (1) is designed to determine whether the original object is now present, this case (2) is to determine whether the original object has been modified and if so, detecting how. In some embodiments, regions of interest having poor or no matching fingerprint features are presumed to have been altered.

In case (3), the item may not have been fingerprinted while its provenance was secure. An example would be legacy bills or passports created prior to initiating the use of a digital fingerprinting system during the creation process. In this case, the fingerprints of regions of interest may be compared with fingerprints from examples of known counterfeit objects, or with both those and fingerprints of known good objects. As an example, if a photo is added to a passport, the edge of the photo is liable to be sharper than the edge of the original, unaltered photo, indicating a cut and paste operation. Fingerprint characteristics of known good passports and those of passports known to have been altered by changing the photograph can be compared with the passport being inspected to determine whether it shows features of alteration.

FIG. 6 is a simplified flow diagram of a process 600 for building a database for use in detecting counterfeit (forged or altered) objects. At process block 602, digital image data is acquired of a known forged or altered document. Next, we extract features from the image data, as discussed above, block 604. Continuing, at block 606 a digital fingerprint is created based on the extracted features.

The digital fingerprint data is stored in a database record, block 608. Further, the record (digital fingerprint) is designated in the database as having features tending to evidence a forgery, block 610. The basic process may be repeated, loop 650, to acquire more images, and more features, to build the database.

Returning to case (4), the question of authenticity or alteration is not at issue. Instead we use the fingerprinting process to determine whether an object was manufactured sufficiently close to the manufacturing specification. In this case comparison of fingerprint features is against the ideal features of a presumed-perfect object, referred to as the “reference object”. The reference object may exist (e.g. be one or more examples of the object that has been inspected by hand and declared good enough to serve as a standard) or may be a programmatic ideal. In this latter case the “ideal” fingerprint features will be generated manually or by a program rather than scanned off an original.

The Object Feature Template can contain a variety of information related to that class of objects. For instance, it would typically include the authentication region(s) for that class of objects, which authentication regions are required to determine a match, and a list of key features that are typically used in authenticating that object.

Additionally, a template can define methods to be applied to features that can be used to examine an object for signs of unauthorized modification or counterfeiting. For instance, every time a passport is scanned into the system, a program can automatically be run to examine the passport photo for signs of alteration. If the passport was fingerprinted at creation, fingerprints extracted from each such region at creation will be compared to fingerprints from corresponding regions when the passport is presented for authentication. If the passport was not fingerprinted at creation, the region template can be used, for example, to look for sharp, non-bleeding edges that can indicate where a photograph has been replaced or torn paper fibers can indicate where an erasure occurred. In addition to the examples discussed above, the Object Feature Template is designed to be extensible, and can store any additional data that is related to the object.

Digital Fingerprint Generation

Once an object has been scanned and at least one authentication region has been identified, the final digital image that will be used to create the unique digital fingerprint for the object is created. This image (or set of images) will provide the source information for the feature extraction process.

A “digital fingerprinting feature” is a feature of the object that is innate to the object itself, a result of the manufacturing process, a result of external processes, or of any other random or pseudo random process. For example, gemstones have a crystal pattern which provides an identifying feature set. Every gemstone is unique and every gem stone has a series of random flaws in its crystal structure. This crystal pattern may be used to generate feature vectors for identification and authentication.

A “feature” in this description is typically not concerned with reading or recognizing meaningful content by using methods like OCR (optical character recognition). For example, a label on a scanned object with a printed serial number may give rise to various features in fingerprint processing, some of which may become part of a digital fingerprint feature set or vector that is associated with the object. The features may refer to light and dark areas, locations, spacing, ink blobs, etc. This information may refer to the printed serial number on the label, but in the normal course of feature extraction during the fingerprinting process there is no effort to actually “read” or recognize the printed serial number.

As part of identifying the object, however, for ease of comparison of fingerprint features with those of the original which are stored in the object database, such information may in fact be read and stored by utilizing such techniques as optical character recognition. In many cases, serial numbers may be used as the primary index into a manufacturer's database, which may also contain the digital fingerprints. It would be far faster, for example, to determine whether a bank note being inspected is a match with a particular original if we can use the serial number, say “A93188871A” as an index into the digital fingerprinting database, rather than trying to determine which one it matches by iterating through many thousands of fingerprints. In this case (and in similar cases of weapon and passport serial numbers), the index recognition speeds up the comparison process but is not essential to it.

Once a suitable digital fingerprint of an object is generated, it may be stored or “registered” in a database. For example, in some embodiments, the digital fingerprint may comprise one or more fingerprint features which are stored as feature vectors. The database should be secure. In some embodiments, a unique ID such as a serial number also may be assigned to an object. An ID may be a convenient index in some applications. However, it is not essential, as a digital fingerprint itself can serve as a key for searching a database. In other words, by identifying an object by the unique features and characteristics of the object itself, arbitrary identifiers, labels, tags, etc. are unnecessary.

FIG. 2 is a simplified flow diagram of a process 200 for digital fingerprint generation and registration. In this case, the process begins with scanning the object, block 202. An image 250 is acquired, in this illustration an U.S. passport is used. The next step is to identify or generate an authentication region, block 204. For example, the authentication region may be the portion 252. The authentication region may be identified, as discussed above, from an object feature template (see Table 1). Next an object class of the object is determined, block 206. The result is used to check a database for a corresponding object class template, decision 208. If there is no matching template, the process proceeds to extract features, block 210 without the aid of a template. A digital fingerprint is created based on the extracted feature, block 212, and that digital fingerprint is stored in an object database 220.

Alternatively, if a matching object class template is found at decision 208, the process continues to extract features, block 222, utilizing the class template 223 to identify and locate the features. Then, a digital fingerprint is created from the resulting feature data, block 224, and stored in a database 230.

Authentication and Inspection Processes

When an object is presented, it is scanned and an image is generated. At that point, the steps to be followed depend on the operation to be performed. Several illustrative cases are discussed below.

Case #1: For authentication of a previously fingerprinted object, the following steps may be followed (see FIG. 3, discussed below):

-   -   1. The authentication region (or regions) is either determined         automatically by the system, or by utilizing the authentication         region definitions stored in an Object Feature Template.     -   2. The relevant features are extracted from the authentication         region(s) and the digital fingerprint is created. This will         typically be in the form of feature vectors, but other data         structures may be used as appropriate.     -   3. Optionally, a unique identifier such as a serial number may         be extracted and stored to augment subsequent search and         identification functions.     -   4. The digital fingerprint of the object to be authenticated is         compared to the digital fingerprints stored in the database.     -   5. The system reports whether the object is authentic; i.e.         whether it matches one of the digital fingerprints stored in the         database.     -   6. The system may then store the digital fingerprint of the         object to be authenticated in the database along with the         results of the authentication process. Normally only the         extracted features will be stored in the database, but the         authentication image and/or the original image may be stored in         the database for archival or audit purposes.

FIG. 3 illustrates such a process 300 in diagrammatic form. Beginning at start block 302, the process scans an object and creates an authentication image, block 304. The image is represented at 350, again using the passport example. Features are extracted, block 306, and optionally a serial number or similar ID number, preferably unique, may be extracted as well, block 310.

The extracted data is processed to create a digital fingerprint, bloc 312. An object database 320 may be queried for a matching fingerprint, block 314. A “match” may be defined by a probability or similarity metric. Results of the database query may be reported to a user, block 322. Finally, a new digital fingerprint may be stored into the database 320, shown at process block 330.

Case #2: For inspection of specific features of a previously fingerprinted object to determine whether they have been altered, the steps are similar to Case #1, but the process is used for the detection of alterations rather than authentication of the object:

-   -   1. The authentication region (or regions) is either determined         automatically by the system, or by utilizing the authentication         region definitions stored in an Object Feature Template.     -   2. The features to be inspected are extracted from the         authentication region and the digital fingerprint is created.         This will typically be in the form of feature vectors for the         features to be inspected but other data structures may be used         as appropriate.     -   3. Optionally, a unique identifier such as a serial number may         be extracted and stored to be used to augment subsequent search         and identification functions.     -   4. The digital fingerprint of the features to be inspected for         alteration is compared to the fingerprint of the corresponding         features from the original object stored in the database.     -   5. The system reports whether the object has been altered; i.e.         whether the digital fingerprint of the features to be inspected         match those previously stored in the database from the original         object.     -   6. The system may then store the digital fingerprint of the         features to be inspected in the database along with the results         of the inspection process. Normally only the features will be         stored in the database, but the authentication image and/or the         original image may be stored in the database for archival or         audit purposes.

Case #3: For inspection of the specific features of an object that has not been previously fingerprinted to determine whether the features have been altered, the following steps may be followed, referring now to FIG. 4.

The system scans the object, block 404, and creates an authentication image 450 that includes at least one authentication region. The authentication region (or regions) may be determined automatically by the system, or by utilizing the authentication region definitions defined in a stored Object Feature Template 406 as noted earlier. Either way, the process next extracts features from the authentication region(s), block 408, and a digital fingerprint is created. This will typically be in the form of feature vectors, but other data structures may be used as appropriate.

The features of the object are then analyzed, block 420, and examined for attributes indicative of a counterfeit, block 402. Methods may be applied to the features by running programs that are listed in the Object Feature Template that check features for signs of counterfeiting. Features can also be statistically compared to features of other objects of the same class that are stored in the database using Bayesian algorithms or other methods to find suspect variations that the standard methods may not catch. Optionally, a serial number or similar ID may be extracted, block 410.

The system preferably reports whether the object shows signs of alteration or counterfeiting, block 422. The system may then store the digital fingerprint of the object to be inspected, block 424, in the database 430 along with the results of the inspection process. Normally only the extracted features will be stored in the database, but the authentication image and/or the original image may be stored in the database for archival or audit purposes.

Case #4: For inspection of an object to determine whether it was manufactured in conformance with the manufacturer's specification, the following steps are followed; referring now to FIG. 5. The authentication region (or regions) for an object 502 is determined by utilizing the authentication region definitions stored in an Object Feature Template 506. In this illustration, the object 502 is a U.S. $100 bill. Scanning and creation of an authentication image are indicated at process block 504.

The manufacturing features are extracted from the regions of interest, block 508, and the digital fingerprint is created (not shown). This will typically be in the form of feature vectors for the manufacturing features, but other data structures may be used as appropriate. Optionally, a unique identifier such as a serial number may be extracted, block 510, and stored to be used to augment subsequent search and identification functions.

Next, the digital fingerprint of the manufacturing features of the object to be checked is are analyzed, block 520, and compared to a fingerprint of the manufacturing features from a reference object (i.e., a perfect manufactured object) stored in the database, illustrated at block 521. In other words, in some embodiments, a reference object may be “fingerprinted” and used as a proxy for manufacture specifications. In other cases, the digital fingerprint of the object, and more specifically the extracted feature vectors, may be compared to reference feature vectors that are based on manufacture specifications. This type of comparison speaks to quality of the object, but may not indicate provenance.

The system reports, block 522, whether the manufactured object meets specifications; i.e. whether the digital fingerprint of the manufacturing features sufficiently match those stored in the database from the reference object. The system may then store the digital fingerprint of the manufacturing features in the database 530, process block 524, along with the results of the manufacturing inspection process. Normally only the extracted manufacturing features will be stored in the database, but the manufacturing inspection image and/or the original image may be stored in the database for archival or audit purposes.

Because in all of the above cases we may be extracting features from images produced under variable lighting conditions, it is highly unlikely two different “reads” will produce the exact same digital fingerprint. In a preferred embodiment, the system is arranged to look up and match items in the database when there is a “near miss.” For example, two feature vectors [0, 1, 5, 5, 6, 8] and [0, 1, 6, 5, 6, 8] are not identical but by applying an appropriate difference metric the system can determine that they are close enough to say that they are from the same item that has been seen before. One example is to calculate Euclidean distance between the two vectors in multi-dimensional space, and compare the result to a threshold value. This is similar to the analysis of human fingerprints. Each fingerprint taken is slightly different, but the identification of key features allows a statistical match with a high degree of certainty.

FIG. 7 is a simplified flow diagram illustrating a method 700 for using a digital fingerprint of a suspect document to detect a potential forgery or alteration associated with the document. First, image data is acquired of a suspect document, block 702. Then the process extracts features from a selected region of the document, block 704. The extracted features are used to form a digital fingerprint, block 706. Next the digital fingerprint or the extracted features are used to form a query, and the query is used to access a forgery/alteration database in search of a matching record, block 708. If a matching record is returned, decision block 710, then the system may report a potential forgery or alteration associated with the suspect document, block 712. Optionally, multiple results may be combined in reaching a conclusion, block 720, where such are available.

Referring again to decision block 710, if no match is returned (i.e. no record matches the query criteria within a selected tolerance or confidence), then the process optionally may be repeated, block 714, for comparison to additional database records. In other words, the database search may be expanded, see loop 718. Again, multiple query results may be combined. Further, the entire process, defined by loop 730, may be repeated for inspecting and analyzing additional or different regions of the document, block 722. As discussed earlier, multiple regions of interest may be defined. Terminal conditions, not shown, may be implemented.

FIG. 10 is an illustration of an example of feature extraction from a digital image. The original image data, on the left, is searched by any of various software processes to detect and locate a feature in the image. In this case, the only important shape in this image is a square, and it extracts that feature, as shown in the middle of the figure, with “1” pixel values. (Most real implementations will have greater than one-bit pixel values.) Then, the extracted fingerprint feature may be stored as a feature vector as illustrated on the right side of the figure. A feature vector is an n-dimensional vector of numerical values that represent the shape.

In this approach, we may store only the features, not the entire image. In fact, after feature extraction the original image can be discarded. This has obvious advantages in terms of reduced storage requirements. Typical algorithms used for extracting features include but are not limited to edge detection, corner detection, blob detection, wavelet features; Gabor, gradient and steerable output filter histograms, scale-invariant feature transformation, active contours, shape contexts and parameterized shapes.

Referring now to FIG. 11, it illustrates essentially the same process for accessing a comparison or candidate image, extracting features from that image, and again storing each of them as a feature vector. Note that the example above is presented solely for the purpose of explanation, and actual implementations will vary. For instance, many shapes can be parameterized and stored even more efficiently. Instead of storing all the pixels along the boundaries, the square in FIG. 10 could actually just be stored as the lower left and upper right corner points ((x1, y1), (x2, y2)). Similarly, a circle could be stored with just the center point and radius. Feature vectors can store a wide variety of n-dimensional representations such as point, lines, polylines, edges, ridges, histograms and many others.

Once the features are extracted from the original image and the candidate image, the features can be compared directly to determine if there is a match. Typical algorithms for comparing features include but are not limited to nearest neighbor, hashing, indexing feature sets with visual vocabularies, support vector machines, multilayer perceptron and random forests and ferns. A comparison of these feature vectors is illustrated in FIG. 12, resulting in a match.

FIG. 13A illustrates an image of the numeral “3” representing a number printed on an “original” or known U.S. dollar bill. This bill may have been fingerprinted, for example, at the time of manufacture or public release, as described herein. As noted below, fingerprint databases of currency and the like may be secured. And such databases preferably exclude raw image data. This image, on the order of about 40× magnification, shows a couple of distinctive features visible to the naked eye.

FIG. 13B illustrates an image of a number printed on a second or unknown U.S. dollar bill. This second bill may be fingerprinted using the same process, and then the resulting digital fingerprints, i.e., the respective fingerprint feature vectors, may be compared as further explained below, to determine whether or not the second bill is in fact the same one as the first bill, even though it may have changed from wear and tear.

FIG. 14A is a simplified illustration of the results of feature extraction applied to the numeral 3 of FIG. 13A. (Only the ends of the numeral are shown.) Two areas of interest are called out by circles 1720 and 1750. Below we discuss how these areas may be selected in an image. Fingerprint feature extraction is applied to each of these circular regions. The results for each location are stored in fingerprint feature vectors. A collection of feature vectors, say for location 1750, may be stored as a feature vector array. FIG. 14B is a simplified illustration of the results of feature extraction applied to the numeral 3 of FIG. 13B. The same fingerprinting process is applied to this image. The same locations of interest as in FIG. 14A are labeled 1720 and 1760, respectively. Then the stored features (from the original object) are compared with the features extracted from the new object. As in this case, if the features are not encountered in the second object, it is not a match.

One of the key advantages to the feature-based method is that when the object is very worn from handling or use, the system can still identify the object as original, which may be impossible with the bitmapped approach. FIG. 15A shows the same dollar bill image as in FIG. 13A, juxtaposed with FIG. 15B for comparison. FIG. 15B shows the same bill after machine washing, perhaps in someone's pocket.

In FIG. 15B, the image (actually the dollar bill) has been degraded; there is significant loss of ink and destruction of the paper surface in multiple locations. A bit mapped approach would clearly fail to match up here, as the number of pixels that are different is significant—only relatively few of the pixels are the same as the original.

FIG. 16A shows the detail of two fingerprint feature locations as before, 1610 and 1650. FIG. 16B shows detail of the damaged bill with the corresponding locations called out as 1620 and 1660, respectively. Here, one can see visually why a comparison of the corresponding fingerprint feature vectors would be adequate to result in a match. In practice, a much larger number of features would be used.

The image of the damaged bill is analyzed by a processor. The processor accesses a database of previously stored fingerprint data. If the dollar bill serial number is legible (by eye or machine), the record for the corresponding bill may be accessed from the datastore using the serial number as an index. Similarly, if any portion of the serial number is legible, the search for a matching record can be narrowed on that basis. Either way, a candidate record, containing a set of stored regions of interest may be compared to the suspect image.

As explained above, in addition to being able to recognize a worn object, the feature-based approach can deal with problems like rotated images. This is especially important in a system where the retail customer may be taking a picture of an object to be authenticated. In this case external factors like lighting and rotation are not under the manufacturer's control.

Referring now to FIG. 17, it shows the original image on the left side, with a small set of fingerprint features marked as small diamond shapes. This is merely a callout symbol for illustration. In a preferred implementation, as noted, circular areas are used. For each feature (preferably identified in the database record), a search is conducted of the suspect image on the right side of FIG. 17 for a matching feature. The position may not match exactly, due to “stretch”—an effective difference in magnification, and/or due to rotation of the image. Although it may not match locations literally; a mathematical transformation can be defined that maps one image to the other, thereby accounting for rotation and stretch as appropriate. Thus, a bounding rectangle A indicated by the box in the left side image may be mapped to a quadrilateral indicated by the line B in the right side image.

Once an appropriate transformation is found, further matching can be done to increase the level of confidence of the match if desired. In some applications, a number of matches on the order of tens or hundreds of match points is sufficient. On the other hand, the number of non-match points also should be taken into account. That number should be relatively very low, but it may be non-zero due to random dirt, system “noise” and the like. Preferably, the allowed mapping or transformation should be restricted depending on the type of objects under inspection. For instance, some objects may be inflexible, which may restrict the possible deformations of the object.

To summarize the imaging requirements for a typical fingerprinting system, for example for inspecting documents, it should provide sufficient imaging capability to show invariant features. The particulars will depend on the regions used for authentication. For many applications, 10x magnification is adequate. For ink bleeds on passports, bills and other high-value authentication, 40x power is more than sufficient. In preferred embodiments, the software should implement a flexible response to accommodate misalignment (rotation), orientation and scale changes. Color imaging and analysis is generally not required for using the processes described above.

Hardware and Software

Most of the equipment discussed above comprises hardware and associated software. For example, the typical portable device is likely to include one or more processors and software executable on those processors to carry out the operations described. We use the term software herein in its commonly understood sense to refer to programs or routines (subroutines, objects, plug-ins, etc.), as well as data, usable by a machine or processor. As is well known, computer programs generally comprise instructions that are stored in machine-readable or computer-readable storage media. Some embodiments of the present invention may include executable programs or instructions that are stored in machine-readable or computer-readable storage media, such as a digital memory. We do not imply that a “computer” in the conventional sense is required in any particular embodiment. For example, various processors, embedded or otherwise, may be used in equipment such as the components described herein.

Memory for storing software again is well known. In some embodiments, memory associated with a given processor may be stored in the same physical device as the processor (“on-board” memory); for example, RAM or FLASH memory disposed within an integrated circuit microprocessor or the like. In other examples, the memory comprises an independent device, such as an external disk drive, storage array, or portable FLASH key fob. In such cases, the memory becomes “associated” with the digital processor when the two are operatively coupled together, or in communication with each other, for example by an I/O port, network connection, etc. such that the processor can read a file stored on the memory. Associated memory may be “read only” by design (ROM) or by virtue of permission settings, or not. Other examples include but are not limited to WORM, EPROM, EEPROM, FLASH, etc. Those technologies often are implemented in solid state semiconductor devices. Other memories may comprise moving parts, such as a conventional rotating disk drive. All such memories are “machine readable” or “computer-readable” and may be used to store executable instructions for implementing the functions described herein.

A “software product” refers to a memory device in which a series of executable instructions are stored in a machine-readable form so that a suitable machine or processor, with appropriate access to the software product, can execute the instructions to carry out a process implemented by the instructions. Software products are sometimes used to distribute software. Any type of machine-readable memory, including without limitation those summarized above, may be used to make a software product. That said, it is also known that software can be distributed via electronic transmission (“download”), in which case there typically will be a corresponding software product at the transmitting end of the transmission, or the receiving end, or both.

Integration with Bill Processing Equipment

We propose creation of fingerprint data at the U.S. Treasury or any other producer (printer) of negotiable bills or notes. Preferably, such a system utilizes random, microscopic features unique to each bill's paper and printing. For example, the system may extract features from unpublished locations on the bills. In other words, the specific locations used for authentication are maintained in secrecy. The extracted features may be converted into encrypted feature vectors and associated in a data store with the corresponding bill serial number (the serial number having been readily captured by the same scanner or a separate one). In this way, a protected database may be created that is addressable or searchable by serial number or feature vector, but only by authorized users. (Here, a “user” may be a machine with electronic access to the database.)

Equipment is known for stacking, counting, and “strapping” paper money notes. A “strap” is a package of 100 notes, held together by a single paper band, as required for deposit by U.S. Federal Reserve rules. Various note handling equipment may be modified to include a digital scanner, for example, an optical scanner, to capture images of each bill or note as it is processed. The scanner may be coupled to a suitable processor, as explained above, for storing the captured images, and for processing the images to authenticate them and/or to detect counterfeit items.

Preferably, such a system is granted access to the protected database that is searchable by serial number or digital fingerprint. It may then look up each bill scanned, and compare features of the digital image to the digital fingerprint stored in protected database for the corresponding serial number. This process may be done by batch or in real time or near real time. The comparison, as further described above, may provide a confidence metric, or a simple yes/no (authentic/counterfeit) result for each note. It may identify a counterfeit note by serial number, but also by sequence number to facilitate locating the bill (“the 28^(th) bill in the strap #218”). In this way, a bank or other institution can detect counterfeit notes in a timely manner.

In another embodiment, a scanner, which may be portable and optionally wireless, may be made available at a bank teller station for the teller to authenticate bills presented for deposit or exchange. Further, such a system may be installed at an ATM machine to automatically authenticate bills presented for deposit. The ATM may be programmed to accept the bills to get them “off the street” but flag them as counterfeit or suspect.

The term “note” is commonly used in the U.K. with regard to paper money, while the term “bill” is more common in the U.S. We use them interchangeably here. Not to be confused with U.S. Treasury “bills” and “notes” which are not currency but debt instruments. That said, the inventions disclosed herein are applicable to those as well as currency, although nowadays such things are mainly processed by electronic “book entries” rather than paper documents. Older U.S. Savings Bonds, and any other bearer instruments in any country, can all be authenticated by various embodiments of the present invention.

Having described and illustrated the principles of the disclosure and some illustrative embodiments thereof, it should be apparent that the invention may be modified in arrangement and detail without departing from such principles. For convenience, we summarize below some aspects of the disclosure. The following list is merely illustrative and not intended to limit or define all the inventions disclosed. The scope of the present invention should, therefore, be determined only by the following claims. 

1. A method, comprising: acquiring a first image of a first physical object; extracting first image data from the first image that describes one or more optically discernible features of the first physical object; generating a first digital fingerprint of the first physical object based on the extracted first image data; storing the first digital fingerprint in a database, wherein the database stores a plurality of digital fingerprints associated with a plurality of physical objects; acquiring a second image of a second physical object; extracting second image data from the second image that describes one or more optically discernible features of the second physical object; generating a second digital fingerprint of the second physical object based on the extracted second image data; comparing the second digital fingerprint with at least some of the plurality of digital fingerprints in the database to obtain a result; and executing a predetermined action based on the result.
 2. The method of claim 1, wherein generating the first digital fingerprint includes: accessing a predefined pattern associated with a category of physical objects to which the first physical object belongs, the predefined pattern including plural elements; parsing the first image data to extract element values corresponding to the plural elements of the predefined pattern; and generating the first digital fingerprint further based, at least in part, on one or more of the extracted element values.
 3. The method of claim 2, wherein a first individual element value for at least one of the plural elements includes a number of pixels within a portion of the first image data.
 4. The method of claim 2, wherein the comparing comprises: comparing pixel data that represents a concentration of pixels and surrounding white areas associated with the second image to corresponding pixel data that represents concentrations of pixels and surrounding white areas in images associated with at least some of the plurality of digital fingerprints in the database.
 5. The method of claim 2, wherein the comparing comprises: comparing a spatial relationship between one or more individual elements of the plural elements of the predefined pattern.
 6. The method of claim 2, wherein the result reflects a match between the first physical object and the second physical object that is definitive within a predetermined range or that is above a determined probability or both.
 7. The method of claim 1, further comprising: mapping a location of visible pixels of the second image, and wherein comparing the second digital fingerprint includes comparing mapped locations with mapped location data corresponding to the at least some of the plurality of digital fingerprints.
 8. The method of claim 1, wherein the second image comprises: a handwritten portion associated with a mailing address, and wherein comparing the second digital fingerprint includes analyzing the handwritten portion to determine a number of transitions from black to white across the handwritten portion.
 9. A machine comprising: an input device to capture image data of at least a first portion of a suspect physical object; a digital fingerprinting software component that, when executed, creates a first digital fingerprint of the suspect physical object based on indicia interpreted in the image data; and a software interface component that, when executed, stores the first digital fingerprint in a database, the database having stored therein a plurality of digital fingerprints associated with a plurality of physical objects, wherein the first digital fingerprint identifies the suspect physical object as being unique among the plurality of objects.
 10. The machine of claim 9, further comprising: an electro-mechanical transport configured to convey the suspect physical object into operational view of the input device, wherein the input device includes at least one optical sensor.
 11. The machine of claim 9, wherein the indicia interpreted in the image data includes dimensional qualities associated with at least one physical feature of the suspect physical object.
 12. The machine of claim 9, wherein the indicia interpreted in the image data includes non-alpha-numeric, optically discernible features of the suspect physical object.
 13. The machine of claim 9, further comprising: a parser software component that, when executed, parses the image data into at least one region of interest, wherein the digital fingerprinting software component, when executed, further: collects region-of-interest data from each region of interest of the image data; generates identifier data for each region of interest, the identifier data pairing an identifier with region-of-interest data of a corresponding region of interest; and associates the identifier data with the first digital fingerprint of the suspect physical object.
 14. The machine of claim 13, wherein the database includes at least one list of classification elements and individual element values for each of the classification elements, wherein each classification element and its corresponding element values together define a corresponding pattern, and wherein the database is arranged to support searching a plurality of patterns for a match based on identifier data associated with digital fingerprints stored in the database.
 15. The machine of claim 9, wherein the first digital fingerprint is based exclusively on image data of the first portion of the suspect physical object and not based on identifying information about the suspect physical object.
 16. The machine of claim 9, wherein the digital fingerprinting software component, when executed, further: extracts information from the image data; categorizes the extracted information to determine a category; and includes a category indicator in the first digital fingerprint.
 17. The machine of claim 9, wherein digital fingerprint data of the first digital fingerprint is responsive to one or more characteristics appearing in a presentable image generated from the image data, the one or more characteristics including a plurality of markings discernible in the presentable image together with respective location data of each of the plurality of markings.
 18. The machine of claim 17, wherein the digital fingerprint data includes size data for each of the plurality of markings.
 19. The machine of claim 17, wherein the digital fingerprint data includes dimension data for each of the plurality of markings, the dimension data defined, at least in part, by corresponding dimensions of each respective marking.
 20. The machine of claim 17, wherein the digital fingerprint data includes pixel-count data for each of the plurality of markings, the pixel-count data defined, at least in part, by a total pixel count of each respective marking.
 21. A non-transitory computer-readable storage medium whose stored contents configure a computing system to perform a method, the method comprising: acquiring a first image, the first image being an image at least a portion of a first physical object; isolating image data associated with a first portion of the first physical object; creating a first digital fingerprint of the first physical object based on indicia interpreted in the image data; and storing the first digital fingerprint in a database, the database having stored therein a plurality of digital fingerprints associated with a plurality of physical objects, wherein the first digital fingerprint identifies the first physical object as being unique among the plurality of objects.
 22. The non-transitory computer-readable storage medium according to claim 21 whose stored contents configure the computing system to perform the method, the method further comprising: acquiring a second image, the second image being an image at least a portion of a suspect physical object; isolating second image data associated with the second image that describes one or more optically discernible features of the suspect physical object; generating a second digital fingerprint of the suspect physical object based on the isolated second image data; comparing the second digital fingerprint with at least some of the plurality of digital fingerprints in the database to obtain a result; and executing a predetermined action based on the result.
 23. The non-transitory computer-readable storage medium according to claim 22 whose stored contents configure the computing system to perform the method, wherein the result represents at least a threshold probability that the suspect physical object is an identified one of the plurality of physical objects whose digital fingerprint is stored in the database. 